Oo-line Cursive Handwriting Recognition Using Recurrent Neural Networks 2 Handwriting Recognition 10 2.1 a Taxonomy of Handwriting Recognition Problems

نویسنده

  • Andrew William Senior
چکیده

Computer handwriting recognition ooers a new way of improving the human-computer interface and of enabling computers to read and process the many handwritten documents that must currently be processed manually. This thesis describes the design of a system that can transcribe handwritten documents. First, a review of the aims and applications of computer handwriting recognition is presented, followed by a description of relevant psychological research. Previous researchers' approaches to the problems of oo-line handwriting recognition are then described. A complete system for automatic, oo-line recognition of handwriting is then detailed, which takes word images scanned from a handwritten page and produces word-level output. Methods for the normalization and representation of handwritten words are described, including a novel technique for detecting stroke-like features. Three probability estimation techniques are described, and their application to handwriting recognition investigated. The method of combining the probability estimates to choose the most likely word is described, and performance improvements are made by modelling the lengths of letters and the frequency of words in the corpus. The system is tested on a database of transcripts from a corpus of modern English and recognition results are shown. Recognition is described both with the search constrained to a xed vocabulary and with an unlimited vocabulary. The nal chapter summarizes the system and highlights the advances made before assessing where future work is most likely to bring about improvements. Declaration This thesis describes research carried out at Cambridge University Engineering Department between October 1991 and September 1994. It is the result of my own work and contains no work done in collaboration. The length of this thesis, including references and gure captions, is thirty-seven thousand words. Acknowledgements First of all, I would like to express my gratitude to the late Professor Frank Fallside, for supervising me during the rst half of this thesis and for providing the original inspiration for this work. I am also indebted to Dr Tony Robinson who has supervised me admirably for the latter half of this thesis with enthusiastic guidance and support, particularly in the last few weeks. I would like to thank everyone else in the Speech, Vision and Robotics group which Frank Fallside created. The group has been an ideal environment , both socially and technically, in which to conduct research. Those in the group who have helped in the creation of this thesis are too numerous to mention individually. Special thanks must …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Online Cursive Handwriting Mongolia Words Recognition with Recurrent Neural Networks

This paper primarily discussed Online Handwriting Recognition methods for Mongolia words which being often used among the Mongolia people in the North China. Because of the characteristic of the whole body of the Mongolia words, namely connectivity between the characters, thereby the segmentation of Mongolia words is very difficult. We introduced a recurrent neural network to online handwriting...

متن کامل

Offline Arabic Handwriting Recognition with Multidimensional Recurrent Neural Networks

Offline handwriting recognition is usually performed by first extracting a sequence of features from the image, then using either a hidden Markov model (HMM) [9] or an HMM / neural network hybrid [10] to transcribe the features. However a system trained directly on pixel data has several potential advantages. One is that defining input features suitable for an HMM requires considerable time and...

متن کامل

Enhancing Neural Confidence-based Segmentation for Cursive Handwriting Recognition

This paper proposes some directions for enhancing a neural network-based technique for automatically segmenting cursive handwriting. The technique fuses confidence values obtained from left and center character recognition outputs in addition to a Segmentation Point Validation output. Specifically, this paper describes the use of a recently proposed feature extraction technique (Modified Direct...

متن کامل

Fusion of Segmentation Strategies for Off-Line Cursive Handwriting Recognition

Cursive handwriting recognition is a challenging task for many real-world applications such as document authentication, form processing, postal address recognition, reading machines for the blind, bank check recognition, and interpretation of historical documents. Therefore, in the last few decades, researchers have put an enormous effort into developing various techniques for handwriting recog...

متن کامل

An Investigation of Context-dependent and Hybrid Modeling Techniques for Very Large Vocabulary On-line Cursive Handwriting Recognition

This paper addresses a very challenging topic in on-line handwriting recognition. It deals with the problem how to further improve a baseline very large vocabulary HMM-based handwriting recognition system with a vocabulary size of 200.000 German words. The use of sophisticated HMM-technology allows the construction of such a baseline system. It is however an extremely difficult task to further ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994